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^5 ' Abstract 

CN I With Gfe (n) the number of functions of n boolean variables defin- 
able by fc-SAT formulae, we prove that 6*3(71) is asymptotic to 2"^v3J. 

I— I. This is a strong form of the case fc = 3 of a conjecture of Bollobas, 

^^ ' Brightwell and Leader stating that for fixed k, log2 Gk{n) ^ (J!). 

^ ' 1 Introduction 

a ; 

^ \ Let Xn = {xi, . . . , Xn} be a collection of Boolean variables. Each variable x 
is associated with a positive literal, x, and a negative literal x. Recall that a 

T— I ■ k-SAT formula (in disjunctive normal form) is an expression ^ of the form 
>• 

^. Civ---vc,, (1) 
00 ■ 

^v^ ■ with t a positive integer and each C, a k-clause; that is, an expression 

lX '• yi A • • • A yfc, with yi,- ■ ■ ,yk literals corresponding to different variables. 

(^ ' A formula ([1]) defines a Boolean function of xi, . . . , a;„ in the obvious way; 

^^ ■ any such function is a k-SAT function. Though we will be concerned here 

• . , almost exclusively with the case A; = 3, we leave the discussion general for 

.5^ I the moment. 

^ ' Following [3], we write Gk{n) for the number of A;-SAT functions of n 

variables. Of course Gk{n) is at most exp2 [2^^ (^) ] , the number of /c-SAT 

formulas; on the other hand it's easy to see that 



^ 



Gfc(n) >2"(2ifci -n2U J) ~2"+UJ (2) 

(all formulas obtained by choosing yi G {xj, Xj} for each i and a set of clauses 
using precisely the literals yi, . . . , y„ give different functions). 
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The problem of estimating G3(n) was suggested by Bollobas, Brightwell 
and Leader [3]. They showed 

Gfc(n)<exp2[(2V^)(^)], (3) 

for k < n/2 and conjectured that 

log,G,in)<{l + o{l)){l). (4) 

for any fixed k. Even k = 2 is not easy; here @ was proved in ^, and the 
precise asymptotics — 

G2(n) ~ exp2[n+ (2)] (5) 

— conjectured in [1] were proved in [1] and (later) in [TT]. As is often the 
case, nothing from this earlier work seems to be of much help in treating 
larger k. 

Here, for A: = 3, we prove Q and more, again showing (as in (j5])) that 
([2]) gives the asymptotics not just of logG3(n), but of G^{n) itself: 

Theorem 1.1. G^{n) ~ 2"+(3). 

For a formula "^ as in ([1]) we may identify the associated function, 
say /f^, with the set (henceforth also referred to as a "fc-SAT function") 
fI'^) Q {0,1}" of satisfying assignments for "^ (that is, FC^) = fc^^{l)). 
For our purposes it will also usually be convenient to think of '^ as the 
set {Ci, . . . ,Ct} of clauses. Then F(^') C F^^) whenever <r' C ^, and 
we say ^ is irredundant if it is a minimal formula giving -F('^); that is, if 
FC^O C F(^) for each 'if' C "^. Of course each 3-SAT function F corre- 
sponds to at least one irredundant '^, so that, with I{n) = Iz{n) denoting 
the number of irredundant formulas on X„, Theorem 11.11 is contained in 

Theorem 1.2. I{n) ~ 2"+(3). 

This (together with ([2])) says that in fact most F's admit only one ir- 
redundant formula. We regard this simple idea as one of the keys to the 
present work: it allows us to forget about functions and work directly with 
formulas, which are easier (though to date still not easy) to handle. 

Notice that "^ is irredundant iff for each C ^ ^ there is some (not 
necessarily unique) witness wq G {0, 1}" that satisfies C but no other clause 
in 'if (i.e. wc E F{'Tf) \ F{'ia \ {C})). Such witnesses will be central to our 
analysis. For the rest of this paper, we use "formula" to mean "irredundant 
formula" (but we will still sometimes retain the "irredundant" for emphasis). 



We feel sure that the analogues of Theorems 11.11 and 11.21 hold for any 
fixed k in place of 3; that is (with /^(n) the number of irredundant /c-SAT 
formulas of n variables), we should have 

Conjecture 1.3. For each fixed k, Gk{n) ~ Ik{n) ~ 2"'^vfc''. 

While we do think it should be possible to prove this along the present 
lines, the best we can say for now is that our argument can probably be 
generalized to reduce Conjecture II .31 to a finite problem for any given k; see 
the remarks following Corollary 16.31 For example, at this writing we are 
pretty sure we could do A: = 4; but as this doesn't contribute anything very 
interesting beyond what's needed for A; = 3, it seems not worth adding to 
the present, already very long argument. 

On the other hand, if we retreat to A; = 2 then much of the present proof 
evaporates — in particular hypergraph regularity becomes ordinary Szemeredi 
regularity — leaving perhaps the easiest verification of ^ to date. (Of 
course — if one cares — anything based on regularity must give far slower con- 
vergence than the argument of |llj.) 

From now on we will be concerned only with the case k = 3, and will 
say "clause" for "3-clause," "formula" for "(irredundant) 3-SAT formula," 
and so on. Let us try to say what we can about the proof at this point. The 
argument proceeds in two phases. The first of these — which, incidentally, 
gives the asymptotics of log I{n), though the proof doesn't need to say this — 
is based on the Hypergraph Regularity Lemma (HRL) of P. Frankl and V. 
Rodl [8] , a pioneering extension to 3-uniform hypergraphs of the celebrated 
(graph) Regularity Lemma of E. Szemeredi [IT]. (See e.g. [E], [9] for more 
on the spectacular recent developments on this topic.) 

A mild adaptation of some of the material in |8j shows that each irre- 
dundant ^ is "compatible" with some "extended partition" V* (defined in 
Section [3|). On the other hand we show — this is Lemma |3. 11 the upshot of 
this part of the argument — that the set of "^'s compatible with V* is small 
unless V* is "coherent." Since the number of 7^*'s is itself negligible relative 
to what we are aiming at, this allows us to restrict our attention to '^'s 
compatible with coherent V*^s. 

Coherence of V* turns out to imply that there is some z G {0, l}" so 
that for any '^ compatible with V* every witness for '^ mostly agrees (in the 
obvious sense) with z. Once we have this we are done with V* and the HRL, 
and, in the second phase, just need to bound the number of "^'s admitting 
a z as above, so for example the number of 'i^'s for which every witness 
is at least 99% zeros (note we expect that a typical such '^€ uses mostly 



positive literals.) While this can presumably be handled as a stand-alone 
statement, we instead give a recursive bound (see (j20p ) that includes minor 
terms involving earlier values of /. 

The paper is organized as follows. Section [2] fills in what we need from 
hypergraph regularity. Once we have this we can, in Section [31 make the 
preceding mumble concrete and complete the proof of Theorem 11.21 assuming 
various supporting results. These are proved in the remaining sections: after 
some preliminaries in Section UJ Sections [5] and [6] implement the first part 
of the above sketch (proving Lemma 13. ip ; the easy Section [7] then produces 
the above-mentioned z associated with a coherent V*; and the final part of 
the argument (proving (f20]l ) is carried out in Section [8l 

Usage 

Throughout the paper we use log and exp for log2 and expg, and H for 
binary entropy. We use "x = 1 ± y" for "x G {1 — y,{l + y))." With the 
exception of (i22]l (in Section [3]) we always assume that n is large enough to 
support our assertions. Following a common abuse, we usually pretend that 
all large numbers are integers, and, pushing this a little, we will occasionally 
substitute, e.g., "at most a" for "at most a + 1" in situations where the extra 

1 is clearly irrelevant. 

2 Regularity 

In this section we recall what we need from [8j and slightly adapt what they 
do to our situation. Our notation follows theirs as much as possible. 

For a bipartite graph G = {AU B,E), A' O A and B' C B, the density 
of the pair {A', B') is 

d{A',B') = dG{A',B') = \EiA',B')\/i\A'\\B'\) 

(where E{A',B') is the set of edges joining A' and B'). In particular, the 
density of G is d{A,B). The graph G (or the pair (A,B)) is e-regular if 
\d{A',B') - d{A,B)\ < e for ah A' Q A and B' (^ B with \A'\ > e\A\ and 
\B'\ >e\B\. 

For a set V write [y]^ for the collection of 2-element subsets of V . An 
{l,t,ei,e2)-partition V of [F]^ consists of an auxiliary partition 

y = l/o U Fi U • • • U Fi (6) 

with I Vol < t and |Vi| = • • • = |V(| =: ?7i, together with a system of edge- 
disjoint bipartite graphs 

P^. I<i<j<t,0<a<lij<l, (7) 



satisfying 

(a) U^^pH = K{Vi, Vj) := {{x, y} : x e Vi,y G V^} V 1 < i < j < t, and 

(b) all but at most £1(2)771^ pairs {vi,Vj}, Vi G Vi, Vj ^ Vj, 1 < i < j < t, 
are edges of e2-i'egular bipartite graphs Pa ■ 

A partition V as above is equitable if for all but at most £1(2) pairs i,j, 
with 1 < i < j < t, we have 

\P'^^\ < £im^ 

and 

\dp^j{V„Vj) - r^\ < £2 VI < a < kj. (8) 

Note this implies {1 + £2l)~^l < kj (< /), so in fact 

kj = I (9) 

if £2 < ^^^) as will be true below. 

It will be convenient to refer to Vi, . . . ,Vt (but not Vq) as the blocks of 
V and to the Pa^s with a > as the bundles ofV. 

From now on we take V to be Xn, our set of Boolean variables. For the 
following definitions we fix a partition V as above. To simplify notation we 
will often use A,B,C and so on for blocks of P. A triad of P on a triple of 
(distinct) blocks {A,B,C) is P = Pabc = (Pab , Pbc , Pac) , with Pab one 
of the bundles of V joining A and B, and similarly for Pac and PbcO A 
subtriad of such a P is then Q = {Q ab ■, Q BC ■, Q Ac) with Qab ^ Pab and 
so on. Since we are fixing V for the present discussion, in what follows we 
will usually drop the stipulation "of P." 

A triangle of a triad P as above is a triangle in the graph with edge set 
Pab U Pac U Pbc (usually designated by its set of vertices). We write T{P) 
for the set of such triangles and t{P) for \T{P)\. Triangles of a subtriad Q 
and T{Q), t{Q) are defined similarly. 

For a triad P on blocks A, B, C, a pattern on P is tt : {^4, B, C} — )• {0, 1}. 
We interpret this as associating a preferred literal, 7r(x), with each (variable) 
X £ AU B U C; thus, for example, for a G A, n{a) is a if 7r{A) = 1 and a 
if '7t{A) = 0. We also write 7r(a,6, c) for the clause 7r(a)7r(6)7r(c) := 7r(a) A 



*This usage differs slightly from that in fSj, in which triads of V may also use Pq"* 's; the 
change is convenient for us and of course does not affect Theorem 12.11 (formally it makes 
the theorem a bit weaker). 



7r(6) A 7r(c) (where a G A,b £ B and c G C); such a clause is said to belong 

to TT. 

Remark. Of course we could just define patterns directly on triples of blocks, 
but the current definition will turn out to be less troublesome. Note that, 
as above, we will often give the blocks of triad P as an ordered triple, which 
allows us to write, e.g., vr = (1, 1,0) without ambiguity. 

Now fix an (irredundant) formula ^, again regarded as a set of clauses. 
For a triad P and pattern vr on P, we set 

T^ = T'f = {{x, y, z] G T{P) : 7r(x, y, z) G <^}, 



and for the analogue for a subtriad Q of P use T-^{Q). Define the density of 
n to be 

d^ = d'^ = \T^\/t{P). (10) 

For a pattern vr on triad P, integer r, and r-tuple Q = (<3(1), • • • ,Q{r)) of 
subtriads of P, set 



dAQ) 



Ul^,T{Q(s))\ 



We say P is {5, r, ■K)-regular for ^ if for every Q as above with | U^^^ 
T{Q{s))\ > St{P), we have |d^(Q) - (i^| < 6, and {6,r)-regular for '^ if it 
is ((^, r, 7r)-regular for each of the eight patterns tt on P (and ((5, r)-irregular 
otherwise). 

Finally, V is {6, r)- regular for 'if if 

^{t(P) : P is a {S, r)-irregular triad of V} < 5n^. (11) 

Let us emphasize that in the above discussion, the quantities subscripted by 
TT, as well as the definitions of regularity for triads and partitions, refer to 
the fixed ^. 

Theorem 2.1. For all 6, £i with < ei < 26^ and integers to CLf^d lo, and 
for all integer-valued functions r = r(t, /) and decreasing functions 62 = £2(0 
with < £2(0 — ^~^; there are Tq, Lq and Nq such that any formula 'if on 
Xn, with n > A'^O; admits a [5, r) -regular, equitable {l,t, £1,62) -partition V 
for some t and I satisfying to < t <Tq and Iq < I < Lq. 

Proof. This is given by the proof of Theorem 3.11 in [8J (which is the 
same as the proof of Theorem 3.5 beginning on page 151), with some minor 



modifications at the outset. We just indicate what these are, omitting a 
couple definitions that are obvious analogues of their counterparts above. 
We use the initial equitable {lo,to, ei, e2{l))-paTtition Vq (which is defined 
without reference to any hypergraph) to specify hypergraphs T^i, . . . ,7^8) as 
follows. Suppose the blocks of Vq are Vi,...,VtQ. For vr = (7ri,7r2,7r3) G 
{0,1}'^ and X ^ Vi, y ^ Vj and z G V^ with i < j < k, set iT{x,y,z) = 
'^i{x)TT2iy)'n-3{z) (= vri(x) A 7r2(y) A iTsiz)), where 

^(^ \ X if Vi = 0, 

and similarly for ip2{y) and ip3{z). Then let tt"^, . . . ,7r^ be some ordering of 
{0,1}'^, and for s G [8] and x,y,z as above, let {x,y,z} £ T-Lg if (and only 
if) 7r''(x,y,z) G '^. 

The (only) point here is that by starting this way we guarantee that 
clauses belonging to the same pattern in our eventual partition will corre- 
spond to edges of the same T-ig: Theorem 3.11 of [S] gives a partition V as in 
our Theorem 12.11 in which regularity with respect to ^ is replaced by regu- 
larity with respect to each of "Hi, . . . , 11% (which we will not define). But for 
any triad P oiV and pattern vr on P, {6, r, 7r)-regularity for '^ is the same as 
(5, r)-regularity of P (again, we omit the definition) for the appropriate Tis, 
and we are done. (To be unconscionably picky, we should slightly adjust 6, 
since bounds corresponding to (llip for the Tig^s will turn into a bound 86n^ 
for ^.) 



Final remark. In applying Theorem 12. II it will be convenient to require that 
in fact 

lij=l yi,j. (12) 

As noted in ([9]) this is automatically true for i,j satisfying ([8]) (again, assum- 
ing £2 < l~'^ which will be true below); while the assumption (equitability) 
that all but £1(2) pairs i,j do satisfy dSD allows us to arbitrarily modify the 
partitions of the remaining K{Vi,Vjys — we just replace them with parti- 
tions satisfying (|12|) — without significantly affecting (|11|) . (So, to be overly 
precise, we get this very slightly strengthened version of Theorem 12.11 by 
applying the original with a slightly smaller 6. Of course the message here 
is that pairs failing ^ are essentially irrelevant; indeed the only point of 
(|12p is that it makes some things a little easier to say in Section [6l) 



3 Skeleton 

In this section we give enough in the way of additional definitions to allow 
us to state our main lemmas, and give the proof of Theorem II .21 modulo the 
much longer proofs of these supporting results. 

We will soon need to say something concrete about our many param- 
eters, but defer this discussion to the end of the present section. Given 
6,£i,tQ, lQ,r = r{t, I), 82 = ^2(0' ^^^ associated Tq,Lq as in Theorem 12. H de- 
fine an extended partition V* to consist of an equitable (/, t, ei, e2)-pai'tition 
V, with t £ [to,To], / G [/o,^o]) together with 

(a) a set IZiV*) of triads of V that (i) includes no P for which some two 
blocks of P violate ^ or some bundle of P violates e2-regularity, and (ii) 
satisfies 

^{t(P) : P a triad of V not in 1Z{V*)} < 26n^ (13) 

(we will mostly ignore triads not in 1Z(V*))] and 

(b) a value d^ = d^* G {0,t(P)"\ . . . , (t(P) - l)t(P)^\ 1} for each pattern 
TT on some P G TZ{T'*). 

We will call the triads in 1Z(V*) the triads of V* . The bundles of V* 
are those e2-i"egular bundles Pa of V for which the pair {i,j} satisfies dH) 
(so the bundles of V that we allow in triads of V*). A triangle ofV* is a 
triangle belonging to some triad of V* . Say vr is a pattern of V* if it is a 
pattern on some triad of V* and 

dn > 2do, (14) 

where do will be specified below. A clause ofV* is then a clause belonging 
to a pattern of V*; we use K,{V*) for the set of such clauses. 

Say a formula '^ and V* are compatible (written ^ ~ P*) if every triad 
P of V* is ((5, r)-regular for '^, and has dj = d^r for each pattern vr on P. It 
follows from Theorem 12.11 that (for large enough n) every ^ is compatible 
with some V* . (The extra "2" on the right hand side of (J13p covers triangles 
involving pairs {i, j} violating ([8]).) We say V* is feasible if it is compatible 
with at least one '^ and in what follows always assume this to be the case. 
Set 

Ar*(p*) = |{'r :^~P*}|. 

We use A'^* here because we will later work mostly with 

AA(P*) = {'^n/C(P*) :<^~P*} and N{V*) = \M{V*)\. 



Say a triad P of V* is proper if it supports a unique pattern of V* — 
always denoted Tip — and cItt > 1/3. Say / : {blocks of V} — )■ {0, 1} and P 
agree if P is proper and iipiA) = f{A) for each block A of P. Finally, say 
V* is coherent if there is an / as above such that (with ^2 discussed below) 

all but at most C2{q}1^ triads ofV* agree with f. (15) 

The longest part of our argument is devoted to proving, for C2 and all of 
the preceding parameters as described below. 

Lemma 3.1. // 

logiV*(P*)>(l-C2)(^) (16) 

then V* is coherent. 

The argument then proceeds as follows. Fix 5, ei, to, /g, r, 82 (again, see 
below for settings; note r and £2 are functions). As noted above. Theorem 
12.11 implies that each (irredundant) ^ is compatible with some extended 
partition V* . The number of possiblities for V* is, for large enough n, less 
than (say) exp[(logLo)n^]. (There are, very crudely, at most: Tq choices 
for the partition {Vt}; exp[(logLo)(2)] for the bundles Pa ', and exp[(l + 
81ogr7i^)( 3°)Lq] for IZiV*) and the dyr's.) Combining this with Lemma [3T] 
we have, for any constant d < C2 and large enough n. 

Corollary 3.2. All but at most exp[(l — c')(Tj] irredundant 'tf's satisfy 

^ ~ P* for some coherent V* . (17) 

We next need a bound on the number of "^'s that do satisfy pT|) . Define 
the multiplicity, m{y) = m^g{y), of the literal y in ^ to be the number 
of clauses of ^ containing y. Say "^ is positive if m{x) > m{x) for each 
variable x. The 'P*'s will disappear from our argument once we establish 
(with C, again TBA) 

Lemma 3.3. If^ is positive and '^ ^ V* for some coherent V* , then 

any witness w for any clause in ^ has fewer than Qn 1 's. (18) 

The easy proof is given in Section [71 

Write X* for the collection of (irredundant) positive ^'s satisfying (llSp . 
According to Corollary 13.21 and Lemma 13.31 we have 

I(n) <exp[(l- 00(3)] + 2"|X*|. (19) 



In Section [8] we will show, for large enough n and an appropriate positive 
constant c, 

\I*\ < 2(3)+exp[(l-c)(^)]/(n-l) 

+ exp[(l - c)3Q)]/(n - 3) + exp[(;;) - en]. (20) 



The proof of Theorem 11.21 is then completed as follows. Combining ()19p 
and (|20p and setting B{n) = 2" vsJ^ we have (again, for large enough n) 

I{n) < (l+exp[-c'n])5(n) + exp[(l-c')(2)]^(n-l) 

+ exp[(l-c')3(^)]/(n-3) (21) 

(where the change from c to c' takes care of some factors 2" and allows us 
to absorb the first term on the r.h.s. of (|19p in the term exp[—c'n]B{n)). 

We show by induction that ()2ip implies that, for some constant A and 
all n, 

I{n) < (1 + A • 2-^'")5(n) (22) 

(which proves Theorem II. 2p . 

For (j22]) . choose no large enough so that ([2T]) holds for n > no, and then 
choose A > 2 (say) so that (1220 holds for n < uq. Assuming (I22p holds 
up to n — 1 (> no), we have (omitting the little calculation for the second 
inequality) 

I{n)-B{n) < 2-^'"S(n)+exp[(l-c') (2)1(1 + A2-^'("-i))S(n-l) 

+ exp[(l - c')3(^)](l + A2-^'(«-3))S(n - 3) 
< {2-c'" + exp[(-c'(^) + n](l + A2-^'("-i))}5( 

This gives (I22p for n. 



Parameters 

Before proceeding we should say something about relations between pa- 
rameters. Our task in Section [8] is to prove (j20p with some positive c. This 
requires an upper bound on the C produced by Lemma 13.31 (see (i68]l and 
()69p . which involve some additional parameters), which in turn, via Lemma 
3.31 forces (2 in (flSl) to be small (namely it should satisfy (f67l) ). 

Of course for Lemma |3. II to hold, we then need C2 to be small. Specific 
requirements (which, for whatever it's worth, can be satisfied e.g. with C2 
some smallish multiple of Cf) ^^^ given in Section [6] (see (I50p -()52p). These 
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n 



again involve some auxiliaries, mainly Ci and ci, which play roles in Lemma 
16.41 analogous to those of C,2 and C2 in Lemma 13. li (The subscripts are 
arranged in this way because we think of Qi and ci as appearing earlier 
in the argument, Lemma 16.41 being the final intermediate step in the proof 
Lemma 13. II ) 

We then take do to be small compared to C2 (the smallest of the preceding 
parameters), and all of 5,ei,tQ ,Iq small compared to do (where "small" 
means small enough to support our arguments; here we won't spell out the 
requirements, but it will be clear as we proceed that there is no difficulty in 
arranging this). Though unnecessary, it will be slightly convenient to set 

6 = tQ^ = r^ (23) 

(but we retain the names to preserve the flavor of Theorem 12. ip . Finally, we 
take r (= r{t,l)) = l^ and £2 (= ^2(0) = l~^^ ■ (The value of r is needed in 
Section [5] and then the rather severe value of E2 is dictated by Lemma 14.71 
(whose h will eventually turn into r).) 

We will use the usual asymptotic notation a = 0{/3), even when a and /3 
are themselves (usually very small) constants, the interpretation being that 
a < C/3 for some C that could be fixed in advance of any of our arguments. 
But we will also sometimes use inequalities with explicit constants, where 
this seems to make the exposition clearer. 

4 Basics 

Here we collect some general observations, first (Section 14. ip for regular 
graphic partitions, and then (Sections 14.21 and 14. 3p for feasible V*'s. These 
will be used in establishing, in Section [5l limits on legal configurations of 
patterns, the technical basis for the proof of Lemma 13.11 We begin with 
some 

Conventions. 

From this point through the end of Section [6] we fix a feasible V* (for 
which we will eventually prove Lemma 13. ip together with some ^ ~ P*. 
Triads, clauses and patterns are then understood to be triads, clauses and 
patterns ofV*, and we will drop the latter specification. 

As noted above. Section [^T] deals only with graphic aspects of V*, so 
does not really require feasibility. Most of the remaining sections do require 
feasibility, and it is to make use of this assumption that we need '^; that is, 
we are not really interested in ^ itself at this point, but only in the impli- 
cations for V* that can be derived from its compatibility with 'if. For the 
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duration of this discussion (that is, through Section [6]), notation involving 
patterns (e.g. T^) and choices of witnesses will always refer to '^ . 

We will also assume, here and in Section [5l that we have fixed a bundle 
Pa ofV* for any pair of blocks {Vj, V^} used by some triad involved in our 
discussion; thus if two of these triads share a pair of blocks, then they use 
the same bundle from this pair. The bundles and triads under discussion 
may then by specified by their blocks: for simplicity we will usually rename 
blocks A, B,C, . . . and use Pab for the (fixed) bundle joining A and B and 
Pabc for the triad on {A,B^C}. To avoid repeated specification, we will 
always take a,ai ^ A and so on. 

We will also adopt the following abusive but convenient notation. For 
blocks A, B,C and X C A, Y C B, Z C C, we will write XY for the set 
of edges of Pab joining X and Y, and XYZ for the set of triangles of the 
subtriad {XY,XZ,YZ) of Pabc- 

Finally, for a graph G on V, Y (^ V and xi, . . . ,Xk G V \Y, we set 
Y{xi, . . . ,Xk) = {y £ Y : y ^ Xi \/ i £ [k]} (where, as usual, x ~ y means 
xy G E{G)). 

4.1 Decency 

We first need a few easy consequences of graphic regularity, beginning with 
the following basic (and standard) observation (see e.g. Fact 1.3 in [13j). 

Proposition 4.1. If {A, B) is e-regular with density d, then for any B' Q B 
of size at least £\B\, 

\{aeA: \B'{a)\ / {d±e)\B'\}\ < 2e\A\. 

Now suppose that Yi, . . . ,Yk are (distinct) blocks of V* and, for 1 < i < 
j < k, Pij is a bundle of V* joining Yi and Yj (so in particular Pij is £2- 
regular with density l~^±e2)- For distinct xi, . . . , Xs E UYi and Yj{xi : i £ I) 
defined by the Pjj's, say {xi, . . . , Xg} is decent (with respect to Yi, . . . , Y^ 
and the Pjj's, but we will drop this specification when the meaning is clear) 
if for all/C [s], 

\Yj{xi : i G 1)1 = (/-I ± 2e2)'^'m (= (1 ± 2e2lYml~') 

whenever the left side is defined; that is, whenever Xi ^ Yj \/ i £ I . 
The next easy observation is similar to, e.g., (TS", Fact 1.4]. 
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Proposition 4.2. With notation as above, if s is fixed and {xi, . . . ,Xs} is 
decent, then for any u G [k], 

|{x G Yu : {xi, . . . ,Xs,x} is indecent}] < 2^^ ke2m. 

(Actually we will always have A; < 4, but it is no harder to give the general 
statement. In fact s need not be fixed: we just need {l~^ — 2s2y > £2- It may 
also be worth noting that the constant 2^~^^k can always be improved; but 
all we ever really need from Proposition 14.21 is a bound of the form 0{e2Tn), 
so there's no reason to be careful here.) 

Proof, li X £ Yu and {xi, . . . ,Xs,x} is indecent, then there are j £ [k] \ {u} 
and / C [s] such that Xi ^ Yj \/ i £ I and 

\Yj{x) n Yj{xi :i£l)\^ {r^ ± 2e2)\Y,{xi : i £ I)\. 

But by Proposition 14.11 (using \Yj{xi : i £ I)\ > {l^^ — 2s2)^m > £2'm), the 
number of such x's for a given j and / is less than 2£2m. 



In line with the conventions given at the beginning of this section, we 
will in what follows always assume that "decency" refers to the set of blocks 
under discussion, and will tend to drop the specification "with respect to 

From now until the end of Section 14.21 we work with blocks A,B,C, 
employing the conventions discussed earlier and setting P = Pabc- The 
following definitions are given with A, B, C in particular roles, but of course 
are meant to also apply when these roles are permuted. Set (for a £ A) 

L{a) = Lp{a) = {be : {a, b, c} £ T{P)} 

{L for "link"), and, similarly, for an edge ab, 

L{ab) = Lp{ab) = {c : {a, 6, c} £ T{P)] 

(where, recall, we assume a £ A and so on). 

The next proposition, in which decency is with respect to A,B,C, is 
immediate from the definitions 

Proposition 4.3. (a) // a is decent then \L{a)\ = (1 it 2e2l)^rn'^l~^ ; 
(b) If ab is a decent edge, then \L{ab)\ = (1 ± 2e2l)^ml~^; 
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Finally, we need to say something about triangle counts (compare e.g. 
[g Fact A, p. 139]): 

Proposition 4.4. If X,Y,Z are subsets of A,B,C (resp.) with each of 
\Y\, \Z\ at least (1 — 2e2l)^'^£2l'ni, then 

(1 - ^)(1 - 2e20^I^II^II^K"^ < \XYZ\ < \X\\Y\\Z\l-^ + Sesm^. 

In particular, 

(1 - l£2lWl~^ < t{P) < (1 + be2l^Wl'^ 

Proof. Lower bound: There are at least \X\ — 2e2m = (1 ^p)|X| a's 

in X with \Y{a)\ > (1 - 2e2l)\Y\l-^ and \Z{a)\ > (1 - 2e2l)\Z\l-\ and for 
each of these a's we have (now fully using the lower bounds on |y| and \Z\) 
\Y{a)Z{a)\ > {l-2e2l)\Y{a)\\Z{a)\l-\ 

Upper bound: There are at most 2e2m a's with |y(a)| > (1 + 26201^1^""'^ 
or |.Z'(a)| > (l + 2e20l-^l^~"^ (or both) , while for any a we have \Y{a)Z{a)\ < 
m8ix{{l + 2e2l)\Y{a)\\Z{a)\l-'^,e2m'^}. This gives (crudely) 

\XYZ\ < {l + 2e2lf\X\\Y\\Z\r^ + Ae2m^. 



4.2 Triads 

We continue to work with blocks A,B,C and P = PabCj and now fix a 
pattern vr on P. Note in particular that "decency" in this section is with 
respect to these three blocks (and P). Set (e.g.) 

L^(a) = {bc: {a,b,c} £ T^} 

(where, recall, T^r is T^ for our fixed ^) and, for an edge ab, 

L^'iab) = {c: {a,b,c} G T^}. 

Say a is good for vr — or for now simply good — if, with 5i = Vo, 

(i) a is decent, and 

(ii) for any Bi, . . . ,Br ^ B{a) and Ci, . . . ,Cr C C{a), if the edge sets 
Gs := BgCs satisfy | U^^i Gs\ > dirn^l^^ , then 

|L-(a) n (U^=iG.)| = (d. ± 5)\ u:=i Gs\. (24) 
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(Note that (ii) implies the formally more general statement where the num- 
ber of Bi's and Cj's is at most r, since we can add some empty sets to the 
list.) 

For a good o, say b £ B{a) is nice to a (with respect to n, but again 
we'll drop this specification) if {a, b} is decent and 

|L''(a6)| = {d^ ± 25)ml-'^ (25) 

An edge ab is then good if a and b are good and nice to each other. A 
triangle {a,b,c\ is good if its edges are all good and great if it is good and 
belongs to T^^. Finally, we say a vertex is great if it belongs to at least 
6qvi?1~^ great triangles and an edge is great if it belongs to at least d^ml''^ 
great triangles. 

Let ^2 = 462^ + 3(5i, ^3 = 1262 + 35i, 5^ = 114e2/^ + 4^2 + 4^3, and 
7 = 2(^4/do (= G(vo/do))- We will use these ugly expressions in the state- 
ment and proof of the next lemma, but will then immediately pass to the 
relaxed version. Corollary 14.61 at which point 82-, 5^, 5^ will disappear from 
the discussion. 

Lemma 4.5. (a) At least (1 — 52)ra vertices of A are good. 

(b) If a is good, then \{b G B{a) : b is not nice to a}\ < 5^ml~'^ ; thus at 
least (1 — 2e2^ — 5'i)ml~^ vertices of B{a) are nice to a. 

(c) At most 54,nrfil~^ members of T{P) are not good. It follows that T{P) 
contains at least (1 — 7^2 — 6/i)m^l^^ good triangles and at least {d^^ — 7s2l — 
(54)?TT,^/~^ great triangles. 

(d) At least (1 — j)m vertices of A are great, and at least (1 — 'y){m'^l^^) 
edges of Pab are great. 

Corollary 4.6. (a) At least (1 — 7)771 vertices of A are good. 

(b) If a is good, then \{b G B{a) : b is not nice to a}\ < ^ml"^ , and at least 
(1 — 7)777/"-^ vertices of B{a) are nice to a. 

(c) At most jm^l~'^ members of T{P) are not good. At least (1 — 7)777'^/^^ 
triangles ofT(P) are good, and at least {d-,^ — 7)777^/"^ are great. 

(d) (Repeating:) At least (1 — 7)777 vertices of A are great and at least (1 — 
^)m'^l~^ edges of Pab are great. 

Proof of Lemma \A.5[ We use "bad" for "not good" and for the proofs of (a) 
and (b) set G = Pbc- 
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(a) By Proposition 14.11 at most 4e2m vertices of A are indecent; so failure 
of (a) implies that there is a set ^o of at least {3/2)5im decent vertices of 
A satisfying either (i) for each a G Aq there are -Bi(a), . . . ,Br{a) C B{a) 
and Ci(a), . . . , Cr{a) C C(o) such that, with Gs{a) = Bs{a)Cs{a), we have 
|U^=iG,(a)| >(5im2/-3, and 

|L-(a) n {uUGs{a))\ < K " '^)l U^i Gs{a)\, 

or (ii) the corresponding statement with "< (d^ — 5)" replaced by "> (cItt + 
(5)." Assuming the first (the argument for the second is identical) and setting 
Gs = UaeAoGs{a), Hs = yJa&Ao{ab : b £ Bs{a)} and K^ = UaeAoiac : c G 
Cs{a)}, we find that for the subtriads Qs = {Gs,Hs, Kg) of P we have 

I ULi T(Q,)| = ^ I U^^^i Gs{a)\ > \Ao\5i7n'l-^ > 6t{P) 

a€Ao 

(using the upper bound on t{P) in Proposition 14. 4p . while 

|uLiT.(Q.)| = 5^|L-(a)n(uLiG.(a))| 

aeAo 

< ^{d^- S)\ U^=i Gs(,a)\ = {d^ - 6)\ ULi nQs)\, 

a€Ao 
contradicting the (5, r, 7r)-regularity of P. 

(b) Since a is decent, each of |i?(a)|, |C(a)| is at least (1 — 2£2l)ml^^; in 
particular the second assertion in (b) follows from the first. By Proposition 
14.21 |{^ G B{a) : ab is indecent} | < 12e2"i; so we will be done if we show 
that at most 'i5iml~^ 6's violate (j25]) . Suppose instead (e.g., the other case 
again being similar) that there is Bq C B{a) of size at least (3/2)(5im/^^ 
with 

|L^(a6)| < {d^ - 2S)ml~^ V6 G -Bo- 
Then with Gi = BQG{a) we have 

|Gi| > \Bo\\Cia)\{l~^ - 2^2) > \Bo\{l - 2e2lfml-^ > 5im^l-\ 

while 

|L^(a) n Gil = ^ \L''{ab)\ < \Bo\{d^ - 25)ml-^ < {d^ - 6)\Gi\, 
be Bo 

contradicting the assumption that a is good. 
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(c) Of the triangles {a,b,c} of T{P) at most llAe2m^ are indecent (by 
Proposition 14. 2t the constant is of course a bit excessive); at most 362(1 + 
2e2l)^rn^l~^ < 'i62'm^l~^ are bad because at least one of a, b, c is decent 
but bad (by (a) and Proposition I4.3l fa)): and at most 3m{S3ml~^){l + 
2e2l)'^ml^'^ < 4:5sm^l~'^ are decent but bad because one of a,b,c fails to 
be nice to another (by (b) and Proposition I4.3r b)). This gives the first as- 
sertion; the second and third then follow from Proposition 14.41 the latter 
since the number of great triangles of P is at least 

\T^\ - 64771^1-^ = d^t{P) - 5im^l-^ > K - 7^2^ - 54\m^l-^. 

(d) Set rj = 7£2l + 64; thus (c) says that the number of great triangles is at 
least (dvr — r])m?l~^. 

We first consider great vertices a. A good a belongs to at most (fijr + 
25)vn?l~'^ great triangles (namely, \L'"{a)\ < {d.„ + S)\{B{a)C{a))\ < {(!■„ + 
5){l~^ + 2e2Yrn'^ < {d-,^ + 25)m^l~^). Thus, with s the number of non-great 
a's (note a bad vertex is in no great triangles), the number of great triangles 
is at most 

(m - s){d^ + 25)n?r^ + sdorn^r^, 

and combining these bounds gives (using ([HI)) s < {25+r])/{dTr+25—do)7n < 
'jm. 

The argument for edges is similar. A good edge belongs to at most 
(c^TT + 25)ml^^ great triangles, so if s is the number of non-great a6's then 
the number of great triangles is at most ((1 + e2l)m'^l~^ — s){d.n- + 26) + 
sdoml~^. Again combining with (c) bounds s by roughly {6i/dQ)Tn^l~^, and 
the (second) statement in (c) follows since \AB\ > (1 — £2l)m?l~^ ■ 



4.3 More basics 

We continue to work with P = Pabc-, and a fixed vr on P. For the next 
lemma we add a fourth block, say D, which only appears incognito: "de- 
cency" in Lemma |4 . 71 means with respect to A,B,C,D. 

Lemma 4.7. For T C T{P) with \T{P) \T\ < bjm^l'^ and h such that 
h^£2l'^ « d-jr, there are distinct ai,bij and Cij, i,j G [h] satisfying 

(i) {cj, bij,Cij} £ T is great for all i, j, and (ii) any set of four of the vertices 

ai , bij , Cij is decent. 
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In practice T will consist of all members of T{P) avoiding some set of 
pathologies that are known to be rare by the results of Section 14. 2i 



Proof. We first observe that, with a = d-,^ and T* the set of great triples 
from T, we have (using Proposition 14.41 and Corollarv 14.6( 0)) 

\T*\ > at{P) - 5-fm^l-^ - jm^l-^ > (a - 67 - 7e2l^)m^l~^. (26) 

Say an edge ab is fine if \{c : abc G T*}\ > ^aml~^, and a is fine if ab is 
fine for at least ^ml~^ 6's. We assert that 

at most 40{-y/a)m a's are not fine. (27) 

Proof of ()27p . Writing s for the number of non-fine a6's we find (with 
explanations to follow) that \T*\ is at most 

3-fm^r^ + ((1 + 2e2l)m'^l'^ -s){a + 25)ml-'^ + (l/2)sam/~2. (28) 

Here the first term covers triangles on edges ab that are either indecent 
or for which \L'^[ab)\ > (a + 26)ml~'^. (By Proposition 14.21 there are at 
most 0{e2m^) a6's of the first type, a minor term since £2 is much smaller 
than 7/"^. On the other hand, ab decent with |L'^(a6)| > (a + 25)ml~'^ 
implies that either a is bad, or a is good and b is not nice to a; by Corollary 
14.6( a) and (b), there are essentially at most 2'ym?l~^ such a6's; decency 
gives 1X^^(06)1 < (1 + 2e2l)'^ml~'^ .) The expression (1 + 2e2l)m?l~^ is an 
upper bound on the number of decent edges ab, and the rest of (p8|) is 
self-explanatory. 

Combining (|28p and ()26p gives (say) s < 19{'y /a)m^l^^. It follows (using 
I^abI > (1 ~ 2s2l)m'^l~^) that for the number, say u, of fine a6's, we have 

n > (1 - 197/a)m2/"i. (29) 

But we also have, with v the number of non-fine a's, 

u < 282771^ + {m-v){l + 2e2l)mr^ + {l/2)vmr^ < {m - v / 2)ml~^ + AE2n? , 

and combining this with (j29|) gives (f27|) . 

We now turn to producing the sequences described in the lemma. First, 
from the set of at least (1 — 407/a)m fine a's, choose (distinct) ai,. . . ,0^ 
such that 

any 4-subset of the a^'s is decent. (30) 
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This is possible because, by Proposition 14.21 once we have ai, . . . , aj, (pOj) 
rules out at most 0{i'^£2Tn) choices for aj+i. 

Second, for z = 1, . . . , /i, do: for j = 1, . . . , /i choose (distinct) bij,Cij with 
o-ihijCij E T* such that (ii) holds for all a's, 6's and c's chosen to this point 
(that is, any set of at most four vertices from {ai, . . . , a^} U IJ{{^fc«) ^ki} '■ 
k < i or [k = i and I < j]} is decent). We can do this because (again using 
Proposition l4.2p when we come to j: from an initial set of at least (l/2)7Ti/~^ 
6's for which a-ib is fine, at most 0{h^S2Tn) are disallowed because they 
introduce a violation of (ii) or are equal to some earlier b^i', and similarly, 
given bij, there are at least (l/2)aml~'^ — 0(h^e2'm) choices for Cij. 



In Section [5] we will use sequences as in Lemma 14.71 to prove the im- 
possibility of certain combinations of patterns. The underlying mechanism, 
provided by Lemma 14.91 is again similar to uses of ((5, r)-regularity in [8]. 
We first need the elementary 

Proposition 4.8. If Si, . . . , S^ are sets of size at least p with {SiCi Sj\ < 
q '^i ^ j, then for any k < h we have 

|U5,| >\U^^,Si\>kp-{';)q. 

In particular, if h > p/q then taking k = p/q gives | U ^jI > p^/(2g). 

Lemma 4.9. (a) Suppose Xi C A and Yi C B, i = l,...,h with h > 
{\/Kfr+'^-''-^ satisfy 

\Xi\ > Xmr", \Yi\ > Xmr'' V (31) 

and 

\Xi nXj\< Kml-", \Yi n Yj\ < Kml-^ ^i + h (32) 

where A > £2 n^axj/",/ }. Then 

I u XiYi\ > ^m2/^+^-2»-2fe-i. (33) 

(b) If Xi Q A, Y, Q B and Zi C C, i = l,...,h > {\/ nff+^+f-"-^-" 
satisfy 

\X,\ > Xml-", \Yi\ > \ml-^, \Zi\ > Xmr" VVi 

and 

\Xir\Xj\ < Kml"'^, \Yir\Yj\ < KmV, \Zir\ Zj\ < nmr^ \ji^2, 
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where (say) X > 40e2 max{/'',/^^} and k > (20e2^'^+''+^)^/^ then 

I U XiYiZil > ^m3z^+-+/-2'^-2fc-2c-3. (34) 

Remark. The assumptions on A and k, as well as the precise expressions 
involving them in (I33p and (I34p . are best ignored. In practice both will be 
large compared to l~^ (a fortiori to 62), so that the assumptions will be au- 
tomatic and their roles in the conclusions minor. In some of our applications 
we could improve the constants in these conclusions by using, e.g., different 
A's in the two bounds of (l3T]). 



Proof of (a). We have (by (f3T]l and e2-i'egularity) 

\XiY^\>{l-2e2l)X^m^r''-''-^ Vi (35) 

and 

\X,^^nXjYj\ = \{XinXj){Y^r\Yj)\ < (i + 2e2i)K^m^r''-'^-^ Vi / j, (36) 



where the second inequality follows from e2-i'egularity and (I32p when each 
of \Xi n Xj\, \Yi nYj\ is at least £2m, and from \{Xi n Xj){Yi n Yj)\ < £2m? 
otherwise. Combining these and applying Proposition 14.81 (and sacrificing a 
factor like 3/2 to take care of the terms with £2Vs) gives (f33]l . 

The proof of (b) is similar and we won't repeat the argument. Here 
the lower bound on \XiYiZi\ corresponding to ([35]) and the upper bound on 
\XiYiZi n XjYjZj\ corresponding to (j36|) are given by Proposition 14.41 I 

5 Configurations 

We continue to follow the conventions given at the beginning of Section [H 
We will use (for example) 





A 


B 


c 


D 


TT 


O-A 


o-fi 


0-c 


- 


vr' 


TA 


TB 


- 


TD 



to mean that tt and vr' are patterns on Pabc and Pabd respectively, with 
'k{A) = GA (s {0, 1}) and so on. A combination of patterns — called a 
configuration and usually involving more than two patterns — is legal if it 
can arise in a feasible V* . 

Two configurations are isomorphic if they can be obtained from each 
other by interchanging rows, interchanging columns, and/or interchanging 
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O's and I's within a column (so by renaming blocks or patterns, or by in- 
terchanging the roles of positive and negative literals within a block). Of 
course legality is an isomorphism invariant. 

This long section is devoted to showing illegality of certain configurations 
in a feasible V* . To use the feasibility assumption we will (of course) fix 
some 'la ^ V* and then, as usual, our notation (e.g. L'^ , T^^, witnesses) refers 
to ^. We will make repeated use of Lemmas 14.71 and 14. 9^ always with h = r 
(= /^), A = do, and k « 1. Usefulness of the bounds (p3]) and ([M|) then 
requires several lower bounds on do, the strongest of which is 

dg > 10(5. (37) 

Most of our configurations will involve four blocks, but we begin with a 
pair of patterns using just three, say A, B, C, and abbreviate Pabc = P- 

Lemma 5.1. Any two patterns for P differ on at most one of A,B,C. 

Corollary 5.2. There are at most two patterns on P. 

Proof of Lemma 15.11 

Suppose instead that the patterns tti and 1^2 differ on at least two of A, B 
and C, say (w.l.o.g.) 7ri(^) = 7ri(B) = 7ri(C) = 1 and ■K2{B) = 7r2(C) = 0. 
There are then two cases: 



Case 1 


A 


B 


c 


VTi 


1 


1 


1 


vr2 


1 









Case 2 


A 


B 


C 


TTl 


1 


1 


1 


712 












Case 1. According to Lemma 14.71 we can find a (g A) and disjoint pairs 
{bi,Ci) {€ B X C) for i £ [r] satisfying: 

(i) each {a,6j,Cj} is great for tti; 

(ii) a is good for tt2; 

(iii) each set of three of the vertices a, 6j, Cj is decent. 

To see this, let T in Lemma 14.71 consist of those {a,b,c} G T{P) for which 
a is good for tt2. Then Proposition 14.21 (with s = 0), Corollary 14.6( a) and 
Proposition I4.3r a) give 

\T{P) \T\< 0{e2m^) + 7m(l + 2e2lfrr?r^ < bjm^r^. 

(Of course Lemma 14.71 gives more than what we use here.) 
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Let Wj be a witness for 7ri(a, 6j,Cj) (= abiCi) and set 

B, = L^^iaci) \ {h}, a = L^Habi) \ {q}. 
Then for each i we have 

\B,\,\Ci\ > domr'^ (38) 

(since by (i) and the definition of "pattern of "P*" we have \Bi\, \Ci\ > {d.,^^ — 
26)ml~'^ — 1 > doml~'^) and, by the definition of "witness," 

Bi,CiC^Nr\0) 

which implies that 

L^'^a) n Bid = $ (39) 

(since be G L'^^{a) D BiCi would mean that Wj satisfies the clause abc E 'i^, 
contradicting the assumption that Wj is a witness for abiCi). On the other 
hand (iii) says \Bir\Bj\, |CjnCj| < {l + 2£2lYml~^ (V? / j), so that, in view 



of dMD and dSI]), Lemma IMl^a) gives | U AQI > \dt{l + 2e2l)~^m'^l-^ > 



Si'wP'l ^. But then ([39|) contradicts (ii). 

Case 2. By Lemma 14.71 (with 7" = T{P)) we can find triples {aj,&j,Ci}, 
i G [r], satisfying: 

(i) each {aj,6j,Cj} is great for tti; 

(ii) each set of four of the vertices Oj, 6j, q is decent. 
Let Wj be a witness for 7ri(aj, 6j,Cj) (= aibiCi) and set 
Ai = L^^{biCi)\{ai}, B, = L^^{aiCi)\{bi}, d = L''' (aA) \ {ci} . 

Then for each i we have 

\Ai\,\Bi\,\Ci\>doml-^ (40) 

and 

Ai,Bi,aQ^Nr\0), 

the latter implying 

T^, n AiBid = 0. (41) 

On the other hand Lemma [4.9( b) with ()40p and (ii) (which implies that each 

of \Ai n Aj\, \Bi n Bj\, Id n Cj| is at most (1 + 2e2lYml-^) gives 

I U AiB,d\ > ^cl[j(l + 2e2l)-^^m^r^ > 6t{P) 

(where the second inequality uses ([37|) and the upper bound in Proposition 
so that (I4ip contradicts the assumption that 7r2 is a pattern. 
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We now turn to configurations on four blocks, say A,B,C,D. At one 
point in the argument we will need the next result, which is contained in 
Lemma 4.2 of [8] (the "Counting Lemma"). 

Lemma 5.3. Let tti, 112, 113 and tt^ he patterns on PabC; Pabd, Pacd and 
Pbcd, respectively. Then for any ^ ~ P* there are a£A, b£B,c£C 
and d £ D so that 7ri(a,6, c), TT2{a,b,d), 7r3(a, c, d) and 7r4(6, c, d) are all 
clauses ofia. 

glossaryname=consistent Say a configuration is consistent if any two 
of its patterns agree on their common blocks. Our main technical result 
is Lemma 15.51 which in particular says that, up to isomorphism, the only 
inconsistent legal configuration comprised of patterns on three distinct triads 
from a given set of four blocks is 



ConfO 


A 


B 


C 


D 


TTi 


1 


1 


1 


- 


71-2 


1 


1 


- 


1 


vrs 





- 


1 


1 



(To elaborate a little: any configuration of the type described is isomorphic 



to some 





A 


B 


C 


D 


TTi 


1 


1 


1 


- 


71-2 


1 


* 


- 


1 


VTS 


* 


- 


* 


* 



(where the *'s are O's and I's); and then either the *'s are all I's (and we 
have coherence) , or the configuration is isomorphic to Configuration above 
or to one of the first eight configurations of Lemma 15.51 the only slightly 
nonobvious case here being the isomorphism 





A 


B 


c 


D 


TTl 


1 


1 


1 


- 


vr2 


1 





- 


1 


vrs 





- 





1 



Conf 4 


A 


B 


c 


D 


VTi 


1 


1 


1 


- 


71-2 


1 





- 


1 


vrs 





- 


1 






gotten by interchanging the first two rows, the last two columns, and the 
and 1 in the second column.) A convenient rephrasing of the above assertion 
regarding Configuration (which, again, will follow from Lemma l5.5p is 
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Corollary 5.4. In a legal configuration consisting of patterns on three dif- 
ferent triples from a set of four blocks, no column can contain a 0, a 1 and 
a blank. 

Lemma 5.5. The following configurations are illegal. 



Conf 1 
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VTi 
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vr2 
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VTS 


1 


- 









Conf 2 
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- 
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- 
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VTS 





- 









Conf 6 
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TTi 


1 


1 


1 


- 


7r2 
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1 
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1 






Conf 7 
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TTl 
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vr2 
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- 
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VTS 
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1 



Conf 8 
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TTl 
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vr2 
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1 
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VTS 
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1 






Conf 9 
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C 
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111 
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1 


1 
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71-2 
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1 





- 


TTS 








- 


1 



Con/ 10 


^ 


B 


C D 


TTl 
71-2 


1 
1 
1 


1 
1 



1 - 
- 
- 1 



Remarks. The full list of forbidden configurations in Lemma 15.51 is slightly 
more than what we'll eventually need, but it seems worth recording precisely 
what's going on here. Though the arguments are fairly repetitive — and 
we will accordingly give less detail in the later ones — we don't see a way 
to consolidate. An outlier is Configuration 8, which is easily handled by 
Lemma 15.31 but doesn't seem susceptible to an argument like those for the 
other cases. 

Proof of Lemma 15.51 Excepting those for Configurations 7 and 8, each of 
the following arguments begins with a set of variables satisfying certain 
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properties, with existence again given by Lemma l4.7i We only discuss this 
for Configurations 1 and 6 (see also Case 1 of Lemma fS.ip . arguments in the 
remaining cases being similar to (usually easier than) that for Configuration 
1. Note that, without further mention, we assume in each case that the 
specified variables are distinct. 

Configuration 1. Let a, bi, . . . ,br and c satisfy: 

(i) each {a,bi,c} is great for vri; 

(ii) each {a, 6j} is good for 7r2; 

(iii) a is good for vrs; 

(iv) each set of three of the vertices a, bi, c is decent 

(Existence: Take T in Lemma [4. 71 to consist of all {a, 6, c} G T{P) for which 
ab and a are good for 7r2 and vra respectively. Corollary I4.6l fa.b) bounds the 
number of a's that are bad for 7r2 or tt^ by 27m; the number of 6's that are 
bad for tt2 by 7m; and the number of {a, 6}'s with a, b good for tt2 but ab 
bad for 7r2 by ^m?'l~^ . Thus Propositions 14.2 1 and 14.3] give 

which is less than 57771,'^/"^.) 

Let Wj be a witness for 7ri(a, 6i,c) (= abic) and set 

d = L^i (abi) \ {c} and A = ^^' (abi) 

Then Ci,Di C w~ (0), implying 

L^3(a)naA = 0- (42) 

On the other hand, 

\Ci\, \Di\ > doml^'^ 

(given by (i) and (ii)) and (iv) (which bounds each of |Ci n Cj|, |Di n Dj\ by 
(1+2620^"'-^"^ foi" * / j) imply (using Lemma [49]l that |UCiA| > dirn^l^'^. 
But then (j42p contradicts (iii). 

Configuration 2. Choose triples {ai,bi,Ci}, i £ [r], satisfying: 

(i) each {aj,6j,Cj} is great for tti; 

(ii) each {aj,6j} is good for 7r2; 

(iii) each set of three of the vertices ai,bi,Ci is decent. 
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Let Wj be a witness for 7ri(aj, 6j,Cj) (= aibiCi) and set 

^, = L^H^.c,) \ {fli}, Ci = L''^{aA)\{ci}, D, = L^^{aibi). 
Then for each i we have 

|^i|,|Ci|,|A| >dom/~2 
(by (i) and (ii)) and 

The latter imphes 

r^3 n AiCDi = 0, (43) 

while the former, with (iii) and Lemma 14.91 (using ()37p and Proposition 14.41 
as in Case 2 of Lemma l5.ip gives | U AiCiDi\ > 5m'^l~^, and these together 
contradict the assumption that tt^ is a pattern. 

Configuration 3. Choose a and pairs {bi,Ci}, i G [r], satisfying: 
(i) each {a,6j,Cj} is great for vri; 
(ii) a is good for 7r2 and vrs; 

(iii) each set of three of the vertices a, hi, Ci is decent. 
Let Wj be a witness for 7ri(a, 6i,Ci) (= ahiCi) and set 

B, = L^^iaci) \ {bi}, Ci = L^'iabi) \ {a} 

and 

Dl = ^NT\T)r\D{a), TG{0,1}. 

Then for each i we have (by (i)) 

|Si|,|Ci| >dom/^2 ^^^ B^,CiCwr^{0). 

W.l.o.g. there are at least h/2 i's — say those in / — for which \Dj\ > lml~^, 
so that Lemma US] (with (iii), and just using \Dj\ > dQml~^) gives 

\UBiD}\ > \\JieiBiDl\ > 5irr?r^. 

But we also have 

L'^\a)r\BiD} =0, 

so we contradict the assumption that a is good for 7r2. 
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Configuration 4: Let {aij,Ci,dij}, i,j £ [r], satisfy 

(i) each {aij,Ci,dij} is great for ir^; 

(ii) each Cj is good for vri; 

(iii) each set of four of the a^j's, Cj's and dij's is decent. 

Let Wjj be a witness for T:^{aij,Ci,dij) (= dijOidij) and set 

Aij = ^^{cidij) \ {aij}, D,j = L^'^aijCi) \ {dij} 

and 

Br,.=w-.i(r)nS(Q), TG{0,1}. 

Then for all i,j we have 

l^ijMAil > dom/"2 a^]^(j Aij, Dij C wr.i(l), 

the latter implying in particular that 

L^'{ci)nA,B}^=(ll. (44) 

Suppose first that there is an i for which \Bj-\ > ^ml"^ for at least h/2 
j's, say those in J. Then combining our lower bounds on \Aij\ and {BfA 
with (iii) and applying Lemma 14.91 gives 

\Uj^jAijBlj\ >6im^r^. 

But then ([H|) contradicts the assumption that q is good for tti. 

We may thus suppose (at least) that for each i there is some j{i) with 
l-^i* Yi")l ^ \rnl~^ . We then drop the remaining j's and relabel Oj = o,ijfi\, 
di = dij{i), Wj = Wjj(j), Aj = ^ij(j), A = A,j{i) and A = -B°^(j)- 

Since Ai,Di C w^ (1) and A C w^ (0) we have 

r^2 n (uAiAA) = Vi. (45) 

But our lower bounds on sizes (to repeat, these are \Ai\, | A| > dom/~^ and 
\Bi\ > gm/""*^) together with (iii) imply {via Lemma l4.9t note that here the 
\Ai n ^jl's and | A H Dj^s are all at most about ml~'^) 



U AjBjAl > 5'm'^l 



3;-3 



SO that (jl5]) contradicts the assumption that 7r2 is a pattern. 
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Configuration 5: Let {ai,bij,Cij}, i,j £ [r], satisfy 

(i) each {aj,6jj,Cij} is great for tti; 

(ii) each Oi is good for 7r2; 

(iii) each set of four of the Oj's, 6jj's and Cjj's is decent. 

Let Wjj be a witness for Tri{ai,bij,Cij) (= aibijCij) and set 

Ay = L''! {bij,Cij) \ {ai}, Bij = V^ {oiCij) \ {bij}, Qj = V^ (oibij) \ {cy} 

and 

Dlj=wT^HT)nD{a,), TG{0,1}. 

Then 

l^ijl, \Bij\, \Cij\ > doml~ 

and 

implying in particular that 

L-2(a,)n(U,-A,I?4-)=0- (46) 

If there is an i such that |-Dj^,| > hml~^ for at least h/2 j's, then Lemma 
14.91 (with (iii) and our lower bound on \Bij\) gives 

\Llj BijDJjl > 5irr?l~^, 

so that (1461) contradicts (ii). 



We may thus suppose that for each i there is some j{i) with I-D^^cjJ > 
im/~\ and relabel Wj = Wjj(j), Aj = Aij(j), Ci = Ci^i^) and A = -^j°j(j)- 
Then Ai,Ci, Di C w~ (0) implies 

r,3 n (u^iQA) = Vi, 

while Lemma 14.91 gives 

I UAidDil > 6m^r^, 

contradicting the assumption that tt-^ is a pattern. 
Configuration 6: Let c and the pairs {aj,6j}, z G [r], satisfy 
(i) {aj,6j,c} is great for tti; 
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(ii) \L''^{aibi)nD{c)\ > dQml~^; 

(iii) c is good for tt^; 

(iv) each set of three of the vertices Oj, bi, c is decent. 

(For existence we use Lemma 14.71 with T consisting of ah {a, b, c} G T(P) 
for which \L'^^{ab) n D{c)\ > dQml~^ and c is good for ir^. (In showing T is 
large we restrict to ab's that are good for 7r2, but this is not needed once we 
have existence.) 

The number of {a, b, c}'s with ab bad for 7r2 or c bad for tts is bounded, 
as in the argument for Configuration 1, by 57771^/"^. On the other hand, 
if ab is good for 7r2, then e2-regularity (of Pen) gives \L'^'^{ab) Ci D{c)\ > 
{d-n-^ — 2(5) (1 — e2l)ml~'^ > doml~^ for aU but at most e2m c's.) 

Let Wj be a witness for ■Ki{ai,bi,c) (= aibic) and set 

Ai = L^^{b^c)\{a^} and Di = L^^iaA) D 0(0). 
Then Ai, Di C w^" (0) implies 

L"3(c)nAiA = 0; (47) 

but 

|Aj| > dQiril^'^ and |-Dj| > dornl^^ 

(given by (i) and (ii)) and (iv) imply (using Lemma IT9l and (iv); note here 
iDiHDjl is at most about ml^^) \VJAiDi\ > 6im'^l~'^, so that (I47p contradicts 
(iii). 

Configuration 7. For a pattern n on Pabc^ ^^Y c is good for tt relative to d 
if L'^{c) n A'B' / whenever A' C A(c, d) and S' C B(c, d) are each of size 
at least dom/~^; of course "d good for vr' relative to c" for a pattern tt' on 
Pabd is defined similarly. 

To rule out Configuration 7 it will be enough to show that there is some 
{a, c, d} that is great for vra and satisfies 

(i) c is good for vri relative to d; 

(ii) d is good for 1^2 relative to c; 

(iii) {a,c,d} is decent. 

Given such a triple, choose a witness w for 7r3(a, c, d) and set 

A' = L-3(c,ci)\{a} (Cw-i(l)) 
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and 

B^ = w-^{T)nB{c,d), re {0,1}. 

We then have \A'\ > doml'^ (since cd is good for ir^) and, w.l.o.g., \B^\ > 
i(l - 2e2lf-ml~'^, contradicting (i) (since L'^'-{c) n A'B^ = 0). 

For existence of a, c, d as above, we may argue as fohows. We know from 
Corollary 14.6( c) that at least Aom^l'^ triangles {a,c,d} are great for tts, 
so just need to show that the number that fail to satisfy (i)-(iii) is smaller 
than this. The number that violate (iii) is (by Proposition 14.21 as usual) 
0{£2'^)- We will bound the number of violations of (i), and of course the 
same bound applies to (ii). 

By Corollarv 14.6( a) at most ^m c's are not good for tti. On the other 
hand, we assert that if c is good for tti then the size oi D' := {d & D{c) : 
c is not good for vri relative to d} is 0{he2m). For suppose this is false and 
choose di,...,dh G D' with all triples {c,di,dj} decent. (For existence of 
the dj's just note that, as in Lemma 14.71 the number of d's that cannot 
be dj+i is at most 0{i£2m); of course this is where we use the assumption 
that D' is large.) For each i E [r] let Ai C A{c,di) and Bi C B{c,di) be 
sets of size at least doml~'^ with L'^i(c) n AiBi = 0; then Lemma 14.91 (using 
decency to guarantee that the \Ai n ^j|'s and \Bi n -BjI's are small) gives 
I U AiBi\ > 5im?l~'^, so that L'"'^{c) n UAiBi = says that in fact c was 
not good for tti (so we have our assertion). Thus the number of triangles 
{a, c, d} for which {c, d} is decent but violates (i) is at most 

[7m(l + 2£2l)mr^ + 0{he2)m\l + 2£2lfmr'^ < ijm^r^. 



Configuration 8. As mentioned earlier, this one doesn't seem to follow from 
an argument like those above, but is an easy consequence of Lemma 15.31 
according to which there are a,b,c,d such that each of TTi{a,b,c) = abc, 
7r2(a, 6, d) = abd and 7r3(a,c, d) = acd belongs to '^. But this is impossible, 
since a witness w for abc must satisfy either abd (if w{d) = 1) or acd (if 
\N{d) =0). 

Configuration 9. Choose d and {aj,6j}, i G [r] satisfying 

(i) each {ai,bi,d} is great for 713; 

(ii) each set of four of the vertices Oj, bi, d is decent. 

Let Wj be a witness for 713(0^, 6j,(i) (= djbid) and set 

A, = L'^'ihd) \ {ai}, Bi = L^-'{a,d) \ {b,} 
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and 

Cl = ^N-\T)nC{ai,h), TG{0,1}. 

W.l.o.g. \Cl\ > Iml-"^ for at least h/2 i's. But then Ai,Bi C wri(l) and 
l^il, \Bi\ > dom/"^ imply | U AiBiC}\ > 6m?l~^ , so that 

T^, n u^,AC/ = 

contradicts the assumption that tti is a pattern. 
Configuration 10. Choose a and {6,,(ii}, z G [r], satisfying 
(i) each {a,hi,d.i} is great for vra; 
(ii) a is good for vri and 7r2; 

(iii) each set of four of the vertices a, 6j, dj is decent. 
Let Wj be a witness for T:^{a,hi,di) (= abidi) and set 

B, = L^^(a,(i,)\{6a 

and 

C^ = w-i(T)nC(a), TG{0,1}. 

w.l.o.g. |C/| > gm/-^ for at least h/2 i's. But then Bi C wri(l) and 
\Bi\ > doml'"^ give | U BiC}\ > 5im?l~^ and 

L^Ha) n (U^iC/) = 0, 
contradicting (ii). 



6 Coherence 

Here we complete the proof of Lemma 13.11 We continue to work with a 
fixed feasible V* (so that "triad" and so on continue to mean "of P*" unless 
otherwise specified) . As usual in applications of regularity, we will eventually 
have to say that we can more or less ignore some minor effects, here those 
associated with clauses not belonging to patterns oiV*; but we delay dealing 
with this for as long as possible (until we come to "Proof of Lemma 13.11 ' 
below). 
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In addition to the "auxiliary" parameters Ci and ci mentioned earlier 
(at the end of Section [3|) we use (p = .05, chosen to satisfy 

^<{l-H{l/Z))/2 (48) 

and 

(^<min{10-a-blog3: a, be N,a + blog3< 10}. (49) 

We then require 

Ci « Cl, (50) 

meaning C,i < eCl for a suitable small e which we will not specify; 

10C1V9-1 < (Ci/6)2; (5-^) 

and 

Ci > 2c2cr^ (52) 

(Given ^2 we may successively choose d, ci, C2 small enough to achieve (j50|) . 
([5T]) and ([52]) respectively.) 



Define a bundle configuration (BC) of V* to be any /3 = (/3jj : {i,j} G 
(^1)) G H*-^). Similarly, for / C [t], an I-bundk is some /3 = (/3jj : {i,j} G 
(2)) G m^^^ ■ In this case we call the blocks indexed by / the blocks of (3; say 
/? is a k-bundle if |/| = k; and for J C / set /3[J] = {(3ij : {i,j} G (2)) — a 
subbundle or \J\-subbundle of /3. In any case we call the -Pg'.''. 's (i, j in [t], / 
or J as appropriate) the bundles of (3 (or, in the last case, /3[J]). Of course 
those for which {i,j} violates ([8]) or Pl'^^ is not e2-i'egular are essentially 
irrelevant; but they are useful for bookkeeping purposes. 

The next few definitions parallel the discussion leading to Lemma 13. 1[ 
The patterns and clauses of a BC or fc-bundle (3 are those patterns and 
clauses of V* that are supported on (bundles of) /3. We use JC{/3) for the set 
of clauses of f3 (so the set of members of K.{V*) supported on /3), M{f3) = 
{^n/C(/3) :'^~P*} and A^(/3) = |AA(/3)|. 

In contrast we will take a triad of /3 to be any triad of V (the partition 
underlying V*) supported on /?. But note that as soon as a triad supports a 
pattern it is necessarily a triad of "P*; in particular a proper triad of /3 will 
be a proper triad of V* supported on /3. 

It will now also be helpful to define 

/i(/3) = [(1 + he2l^)mH-'^\-^ log iV(/3) (53) 
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and h(V*) = [{1 + 5e2l^)Tn'^l^'^]^^ logN{V*), the expression in square brack- 
ets being the upper bound on t{P) given by Proposition 14.41 (for any triad 
P of V*). This is a convenient normalization: for a pattern vr of V* , say on 
triad P, the number of possibihties for the restriction of a "^ ~ P* to tt is 

(dim) < exp[i?(d.)i(P)] (54) 

(recall H is binary entropy), so that the aforementioned upper bound gives 

HP) < "^{HidTr) ■■ vr a pattern of /?}. 

For P a given /-bundle, J '^ I, and A, . . . , Z the blocks indexed by J, we 
will also write h{A, . . . , Z) for /i(/3[J]). 

For a fixed k, say a /c-bundle /? is coherent if there is /^ : {blocks of /3} — t- 
{0, 1} such that each triad P of (3 agrees with fp (which, recall, includes the 
requirement that P be proper). The definition for coherence of a BC is 
defined is similar to that for an extended partition; precisely: a BC /3 is 
coherent if there is some f = fp '■ {blocks of V*} — )• {0, 1} such that 

all but at most Ci(3) triads of P agree with fp. (55) 

In outline the proof of Lemma l3. II goes as follows. First, the forbidden 
configuration results of Section [5] are used to prove 

Lemma 6.1. For a 4-bundle /3, any legal configuration consisting of one 
pattern on each of the four triads of (3 is consistent. 

(Recall consistency was defined (in the natural way) following the statement 
of Lemma 15.31 ) 

Using this and, again, the results of Section [Sj we obtain what we may 
think of as a "local" version of Lemma 13.11 viz. 

Lemma 6.2. A 5-bundle j3 with 

h{(3) > 10 - (/J (56) 

is coherent. 

Corollary 6.3. For any 5-bundle (5, h{l3) < 10. 

Remarks. Note that the analogues of Corollary 16.31 and Lemma 16.21 for 4- 
bundles (5 (namely that h{(3) is at most 4 and that /i(/3) close to 4 implies 
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coherence) are not true; rather, /i(/3) can be as large as 3 log 3, as shown 
by adding the pattern vrg = (1, 1, 0) on {B, C, D) to Configuration 11 in the 
proof of Lemma 16.21 below. It is for this reason that we need to work with 
5-bundles. 

For extension of the present results from 3 to larger k, it is getting to 
a suitable analogue of Lemma 16.21 that so far requires /c-specific treatment, 
though a general argument does not seem out of the question. Notice for 
example that for A; = 4, the "5" in Lemma 16.21 will become "7," since (com- 
pare the preceding paragraph) there can be 6-bundles /3 with h{(3) > 15 
(= (J). Here one should of course substitute [15] for [8], which does not 
seem to cause any difficulties. The rest of the argument (i.e. from Lemma 
16.21 onwards) seems to go through without much modification. 

Once we have Lemma [6.2l (and Corollary 16. 3p we are done with all that's 
come before, and may derive Lemma 13.11 from these last two results. A 
convenient intermediate step is 

Lemma 6.4. (a) For any BC /3, h{l3) < (*) . 

(b) Any BC f3 with 

/i(/3)>(l-ci)(*) (57) 

is coherent. 

Before turning to proofs we need some quick preliminaries. We first recall 
Shearer's Lemma [5], which we will need here and again in Section [8l For a 
set W,ACW andJ='C 2^, the trace of J" on A is Tr{T, A) = {FnA:F e 
T}. For a hypergraph Ti on W — that is, a collection (possibly with repeats) 
of subsets of W — we use, as usual, d-^(x) for the degree oi x £ W in T-L; 
that is, the number of members of Ti containing x. The original statement 
of Shearer's lemma (though his proof gives a more general entropy version) 
is 

Lemma 6.5. Let W be a set and T C 2'^, and let 71 be a hypergraph on 

W with d-}{{v) > k for each v G W . Then 



logm<i J]log|Tr(J-,.4)|. 
AeH 

Applications of Lemma 16.51 in the present section will be instances of 
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Corollary 6.6. (a) Suppose 3 < k < q; let I be a q-subset of [t] and j3 an 
I-bundle. Then 

h{P)<il-J^'"Y.{KP[J])--J^{[)}- 

(b) h{V*) < /"(2)+3^/j(^)^ yjhere the sum runs over EC's /3 (ofV*). 

Proof. For (a) apply Lemma [63] with W = /C(/3), J" = J\f{(3) and Ti = 
{ICif3[J]) ■■ J G (Di- Then Tr(^, /C(/3[J])) = Af{P[J]) and dn{C) = {IzD 
for each C G W, and the statement follows. 

The proof of (b) is similar and is omitted. I 

We will also make some use of the following easy (and presumably well- 
known) observation, whose proof we omit. 

Lemma 6.7. Any graph G with s vertices and at least (1 — a) (2) edges 
(where < a < 1/2) has a component of size at least (1 — a)s. 

Finally, we recall that (as in ()54p ). for any m and a G [0, 1/2], 



m 



am 



< exp[//(a)m]. 



Proof oi Lemma [6. II A counterexample would be a configuration of the form 





A 


B 


C 


D 


VTi 


* 


* 


* 


- 


71-2 


* 


* 


- 


* 


VTS 


* 


- 


* 


* 


7r4 


- 


* 


* 


* 



(where the *'s are O's or I's), in which we may assume (invoking isomor- 
phism) that each column contains at most one 0. Since the configuration 
is incoherent there is at least one 0, say (w.l.o.g.) 7ri(A) = 0. But then 
Corollary 15.41 says that the configuration consisting of vri , 7r2 and 7r4 is illegal 
(as is the full configuration). I 

Proof of Lemma 16.21 

Suppose A,B,C are blocks of /3, with P the corresponding triad of /3. 
Of course h{A, B, C) is zero if there is no pattern (of /?) on (^4, B, C), and 
at most 1 if there is exactly one such pattern. We assert that 

h{A,B,C)< logs (58) 
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in any case (really meaning when there are exactly two patterns on {A,B,C); 
see Corollary I5.2p . To see this, suppose (w.l.o.g.) vr = (1,1,1) and vr' = 
(1,1,0) are patterns on {A,B,C) and, for a fixed pair a,b (a G A,b £ 
B), consider the possibilities for the links L^{ab) = Ll^{ab) and L'^ (ab) = 
L7j,{ab) (with '(f ~ P*). We cannot have c G L^'iab) n L""' (ab) unless each 
of these links consists only of c (since e.g. a witness for abc' {d ^ c) would 
agree with one of abc, abc). Thus {L^{ab),L'^ (o-b)) is either a pair of disjoint 
subsets of C{a,b) (= Lp{ab)) or two copies of the same singleton, whence 
the number of possibilities for this pair is less than exp3[|C(a, 6)|] + |C(a, 6)|. 
This nearly gives (f58|) since ^ |C(a, 6)| = t{P); to keep the clean expression 
in (j58p (which of course is not really necessary), one may use the fact that 
'if ~ "P* requires that ^^^ \L'^{ab)\ = djrt(-P), but we leave this detail to the 
reader. (We could also get around this by slightly shrinking the coefficient 
oflogiV(/3) in dMD.) 

It follows, using Lemma 16.11 and Corollary 15. 2| that if A, B, C, D are 
blocks of /3, indexed by J say, with /i(/3[J]) > 3 + -^^(1/3) (> 2 + log 3), 
then either f3[J] is coherent or exactly three of its triads support patterns, 
and at least two of them support two patterns. It's also easy to see, using 
Corollary 15.41 that if we do have the latter possibility, say with two patterns 
on each of {A,B,D) and {A,C,D) and at least one on {B,C,D), then up 
to isomorphism (the set of patterns of) /3[J] contains the configuration 
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We next assert that if (3 is incoherent (and satisfies ([56]) ). then 

some 4-subbundle /?' of (3 is incoherent with h{f3') > 3 + i^(l/3), (59) 

so, according to the preceding discussion, contains Configuration 11. For 
the assertion, notice that incoherence of /3 implies incoherence of at least 
one of its 4-subbundles; so if ([59|) fails, then CoroUarv 16.61 (and the fact that 
h{/3') < 4 for a coherent 4-bundle /3') gives 

hil3) < 1[4 • 4 + 3 + H{l/3)] < 10 - 99, 

contradicting ([5B|) . 
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Assume then that (3 contains Configuration 11; let £^ be the fifth block of 
/3; and let a be the number of triads of /3 that support exactly one pattern, 
and b the number that support exactly two. Then 

h{p) <a + blog3, 

implying in particular (using ([56]) and (jl9|) ) that 

a + blog3>10. (60) 

Corollary 15.41 now says: (i) there is no pattern on {A,B,C} (since such 
a pattern together with (e.g.) tti and 7r4 would violate the corollary); (ii) 
there is either no pattern on {^4, B, E} or no pattern on either of {A, D, E}, 
{B,D, E} (since if vr is a pattern on {A, B, E} and it' a pattern on either of 
{A, D, E}, {B, D, E}, then vr and vr' together with one of vri, tt2 violate the 
corollary); and similarly (iii) there is either no pattern on {A,C,E} or no 
pattern on either of {A, D, E}, {C, D, E}. 

It follows that a + b < 7, which with (j60p implies b > 6, so that there 
is a set of four blocks from {A,B,C,D,E} three of whose triads support 
two patterns apiece (since if 5i, . . . , 5*6 are 3-subsets of a 5-set S, then some 
4-subset of S contains at least three S'j's). But we have already seen, in the 
derivation of Configuration 11, that any configuration consisting of five of 
these patterns must be isomorphic to Configuration 11, whence it follows 
easily that (up to isomorphism) (3 contains Configuration 11 together with 

A B C D 



vTe 



1 1 



The discussion in the preceding paragraph then shows that there is either 
no pattern on {B,C,E} or no pattern on either of {B,D,E}, {C,D,E}; 
and combining this with (i)-(iii) above gives a + b < 6, contradicting ()60p . 



Proof of Lemma 16.41 

(a) This is immediate from Corollaries I6.6r a) (with q = t, I = [t]) and[ 

(b) We first assert that (for (3 as in ([57|) ) 

all but at most lOci 99^^(5) 5-bundles of /3 are coherent. (61) 

Proof. By Lemma 16.21 the number of incoherent 5-bundles of /3 is at most 

.:=|{/G(M):/,(/3[/])<10-^}|. 
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Thus, again using Corollaries 16.6( a) and 16.31 we have 

M/3) < {\y\{Q-s)io + s{w-^)] = {'-y\wQ-^s], 

which, combined with ([57]) . gives s < Wciip^^[^). 

We may then finish via the following simple lemma. Let k,l he integers 
with k < I and W a set of size t. Suppose that for each i? E ( ^ ) we are 
given some aR : R ^ {0, 1}, and for i?, S" G ( ^ ) write R ^ S \i aji and as 
agree on R^^ S. Say L € ( ^ ) is consistent \i R ^ S M R,S ^ (^) . 

Lemma 6.8. For all k, I as above and e > there is a S, > such that (with 
notation as above) if at least (1 — 0{i) l-subsets of W are consistent, then 
there is some f : W ^ {0, 1} such that (Tr = /|/j for all but at most e(^) 
k-subsets R of W . 

We will prove this only for A; = 3 and / = 5, in which case we may take 
^ = (e/6)'^. The proof of the general case, an induction on k, is in a similar 
vein, though not exactly a generalization of the argument given here. 

Of course to get Lemma [6.4( b) from (the case /c = 3, / = 5 of) Lemma [6. 8 1 
we take W to be the set of blocks of V* , set ctr = np whenever P is a proper 
triad and R its set of blocks, and define 0"^^ arbitrarily for the remaining i?'s. 
(Here we use (fST]) .) I 

Proof of Lemma [618] (for k = 3,1 = 5). Let ^ be as above, set a = |v^, and 
say X ^ W is bad if there are at least 0(^)4 pairs {R, S} with: R,S^ ( 3 ) ; 
Rn S = {x}; and R ^ S. If the number of bad x's is b then the number of 
inconsistent 5-sets is at least j^boi{t)4, so we have b < ,^-, ^(g) < ^t. 

If, on the other hand, x is not bad then (by Lemma l6.7p there is f{x) G 
{0, 1} such that (Tr{x) = f{x) for at least (say) (1 — 8a) ( 2 ) 3-sets Rb x. 
So extending this / arbitrarily to the bad x's we find that the number of 
3-sets R that fail to satisfy aR = f\R is at most t ■ Sa^l^ ) ~^ ^{~2 ) ^ 
i8a + l)tt,')=e{i). 



Proof of Lemma \3.1[ We first show that clauses not belonging to K,{V*) are 
more or less irrelevant. We are interested in the number of possibilities for 
■r \ /C(P*) with "^ ~ P*. Members of 'T \ /C('P*) are either 

(i) clauses not supported on triads of V* or 
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(ii) clauses belonging to patterns vr that are supported on triads of V* , but 
that are not patterns of V* (i.e. for which 6-,^ < 2(io). 

The total number of possible clauses of the first type is 0{5 + £i + t~^)n^ = 
0{5n^) (see dSD), where the first term, given by ([TB]) . is for clauses supported 
on triads of the underlying partition V that are not triads of V* . (The 
other two terms bound the number of clauses that use either Vq or some 
Pq ^ or that meet some block more than once.) On the other hand, no 
'i^ ~ "P* contains more than 16^0(3) clauses of type (ii). Thus we have 
(using X;i<fc (T) < exp[i?(A;/m)m]) 

N*{V*) < exp[8F(2do)(3) + 0{5)n^]N{V*). (62) 

Thus (fT6]l implies 

h{V*) > [{l + 5e2lWl'']-'[{l-C2){';)-8H{2do){';)-0{6W] 

> (l-2c2)(*)/3 (63) 

(where we used C2 >> max{H{2do),6,e2l} (= H{2do)) and (3) > {l)m^)- 
We next observe that ((63]) (and so (fT6]l ) implies 

all but at most 2c2C^^^2) BC's of V* are coherent. (64) 

Proof. This is similar to the proof of (|6ip . By Lemma l6.4r b). the number 
of incoherent BC's of V* is at most 

s := |{/3 : /3 a BC of P*; /i(/3) < (1 - ci)(*)}|. 

Thus Corollary 16.6( b) and Lemma l6.4r a) give 

h{V*) < /-(2)+3 ^{/i(/3) : /3 a BC of P*} 

< raH((/a) -.)(*) +.(i-ci)a)), 

which with <\63h implies s < 2c2Ci l^^K {> 

For the rest of this argument /3 ranges over BC's (of V*), P and Q over 
triads ofV, and A, B, C over blocks. For each coherent (3 we fix some fp as 
in (j55p and assign an arbitrary (convenient but irrelevant) fp : {blocks} — )• 
{0, 1} to each incoherent /3. 

Say P and Q disagree at a common block A if at least one of P, Q is not 
proper or (both are proper and) Trp{A) / '7Tq{A). (Here one should think of 

39 



P and Q as having just the one block in common; effects due to pairs with 
larger overlap will be insignificant.) We now proceed roughly as follows. An 
averaging argument shows that for most blocks A there are few pairs P, Q 
that disagree at A. When this happens there must be a value for f{A) that 
agrees with most of the triads using A. The remaining few /-values are then 
of no concern and may be assigned arbitrarily. 

To say this properly, write P ^a Q ^i P and Q disagree at A and have 
no other block in common. Write P '^a /3 if -P is a triad of /3 and either 
P is improper or vrp disagrees with /^ at the block A of P, and P •/- /3 if 
P ^A /3 for some block A of P. Setting 

M = \{{(3,P,Q,A) : P,Q triads of /3; P t^a /3 or Q t^a /3}|, 

we have 

M < 2{\')\{{/3,P,A):P^Af3}\ 
< 6{'-')\{{f3,P):P^/3}\ 

where we use d to bound both the fraction of incoherent /3's (see (I64p and 
(j52|) ) and the fraction of triads that disagree with /^ when /3 is coherent. 
But we also have 

m>|{(A,p,Q):P9^aQ}|/(^)"'; 

thus 

Y, \KP,Q) ■■ P ^A Q}\ = |{(AP,Q) : P ^A Q}\ < OiCit'i 



5r6\ 



implying 

\{{P,Q):P7^AQ}\<Vc'it'l^ (65) 

for all but at most 0{y/Cit) A's. 

For A satisfying (j65p we again appeal to Lemma [6.7| applied to the graph 
G = Ga having vertices the triads (of V) that use j4, and PQ an edge if 
P, Q are proper and irp^A) = ttq^A) (so improper triads become isolated 
vertices). We have \V{G)\ = Ql^ and \E(G)\ < y/^tH^+tH^ (the negligible 
second term being a bound on the number of pairs P, Q that share at least 
one additional block); so the lemma says there is some f{A) G {0, 1} such 
that 'Kp{A) = f{A) for all but at most 0{^/Qit^l'^) triads P using A. 
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Finally, extending this / arbitrarily to A's failing (j65p . we find that 
the number of triads (of V) that are improper or disagree with / — so in 
particular the number (needed for (J15p ) that are proper and disagree with 
/ — is less than 0{y/(it^l^); so, in view of ([SOj) . P* is coherent. 



7 Proof of Lemma 13.31 

It will now be convenient to work with triangles rather than triads, which 
we can arrange, e.g., by observing that (jlSp implies 

all but at most 2^2 (3)771^ triangles belong to triads that agree with f (66) 

(by (|13p . since S is much smaller than ^2)- 

We first need to show that / as in (|66p is mostly 1. Say (just for the present 
argument) that a block Vi is "bad" if at least .05 ( 2 )'tT'^ triangles belong to 
triads that disagree with f at Vi. Let M be the number of bad V^'s and N 
the number of pairs {Vi, K) with Vi a block of V* and K a triangle belonging 
to a triad that disagrees with f at Vi. Then 



6(2(3)^^ >N> .05M{ 



\'W 



gives M < 40C2t- 

Suppose, on the other hand, that Vi is good (i.e. not bad). Then the 
number of clauses (of ^) that agree with / at Vi is at least |(.95)( 2 )m^ 
(since each triad P that agrees with / at Vi is proper and thus contributes 
at least \t{P) such clauses), while the number that disagree is at most 
4(.05 + do)(*2^)?7i3. There is thus (since ^(.95) > 4(.05 + do)) some x e Vi 
that belongs to more clauses that agree with f at x than that disagree, so 
that m{x) > m{x) implies that f{Vi) = 1. So we have shown that 

|/-H0)| < 40C2i. 

Now suppose for a contradiction that w is a witness for some C G '^ and 
|w^^(l)| > C,n. Then for the set, say W, of blocks Vi satisfying 

f{Vi) = 1 and |w-i(l) n Vi\ > Cm/2, 



we have 



Cn < |w~^(l)| < 40C2n + |>V|m + Cn/2, 
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whence 

|W| > (C/2 - 40C2)n/m > (C/2 - 40C2)t. 

It then follows from ()66p . using (say) 

(C/2 - 4OC2)' > 3C2, (67) 

that there is some triad P that agrees with /, all three of whose blocks are 
in W (which, note, implies TTp = 1). But then 

(l-8e20(C/2)^>(^(l + 5e2/^) 

(implied by ([57]) ) and ((5, r) -regularity of P imply that there is some C / 
xyz G '^ supported by P, so that w cannot have been a witness. (In more 
detail: Suppose the blocks of P are Vi,Vj,Vk, and let V^ = w~^(l) n Vu- 
Then using Proposition 14.41 (both the upper and lower bounds), we find 
that for the subtriad Q of P spanned (in the obvious sense) by V-,V',Vl, 
we have 

|r(Q)| > (1 - 8e2l)iC/2fm^l-^ > 6(1-^ + 5e2)m^ > 5t{P); 

thus (5, r)-regularity (here r = 1 would suffice) gives d7rp((3) > dT^p — 6, 
implying the existence of xyz as above. 



8 Recursion 

Here we prove (|20p . From this point we write simply X for X„ (the set of 
variables), and use a, b, c, u, v, w, x, y, z for members of X. We call a clause 
positive (negative) if it contains only positive (negative) literals, and non- 
positive if it contains at least one negative literal. We assume throughout 
that all "^'s under discussion belong to X* (and, as usual, that n is large 
enough to support our assertions). 

As the form of (j20p suggests, the proof will proceed by removing from 
I* "i^'s exhibiting various "pathologies," eventually leaving only (a subset of 
all) '^'s containing only positive clauses; these account for the main term, 
2^3), on the right hand side of ([20]) . 

The arguments again involve interplay of a number of small constants, 
and we begin by naming these and specifying what we will assume in the 
way of relations between them. In addition to c (from ()20p ) and (" (from 
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(fT8]) ). we will use constants a, ■(? and ^, assumed to satisfy the (satisfiable) 
relations 

< c < min{e, ^^ - 7H{2C), ^^^ - SH{g/3)} = ^'^ - 7H{2C), (68) 

where g = v2a + C, and 

^ < min{a - 2i?, \/.04 - 2^9 - i?, 0.1 - 7H{2C), 

1 - lH{^) - 0.3 log 7 - 7H{2C + a)} = a - 2d. (69) 

(These hold if all parameters are small and, for example, q > 2^ > 5H('&) 
and '& > 7H{2C).) 

Step 0. Let 

Zl = {io G X* : each variable is used at least ^("2 ) times in ^}. 

Then 

|X* \Xi*| < exp[.8(^)]/(n - 1). (70) 

Proof. There are at most 



n 



E{('^"P):*<1^(V)} < exp[//(^)8(V)] < exp[.8(^)] 



ways to choose a variable x to be used fewer than ^^("2 ) times, together 
with the clauses that use x, and the collection of clauses of ^ not using x is 
an (irredundant) formula on the n — 1 remaining variables. {> 

Step 1. If ^ G X* then for any two variables u, f there are at most Qn 
variables w for which uvijb G ^. The same bound applies to w^s with uvw G 
'^ and those with uvw G '^. 

Proof. If w is a witness for uvW G ^ then any x ^ w with uvx G ^ must lie 
in w~^(l). The other cases are similar. {> 

In particular: 

(a) for any u, 'tf contains at most (n'^ clauses of each of the forms uvw, uvw, 
uvw, uvw; 

(b) '^ contains at most (say) 2Cn^ non-positive clauses; 

(c) if "^ G X|" then, for any u, '^ contains at least (say) 0.02n^ positive 
clauses using u (by (a), since ^ G X* implies m{u) > ^("2 ))■ 
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Step 2. Let Xg consist of those ^ G X^ that satisfy 

for each u, '^ contains at most av? clauses uvw. (71) 

Then 

\Xl \Xl\ < exp[(l - c)Q)] + exp[(l - c)(^)]/(n - 1). (72) 

Proof. We should show that the number of "^'s in I^ violating ()7ip is at 
most the right hand side of ()72p . Given such a '^ we fix u violating ()7ip and 
set y = X \ {u}, 

R = {{a, b}^Y luabe 'T}, B = {{a, b}CY -.uabG '^}, 

S = {a£Y : dR{a) < t?n}, T = {a £ Y : dsia) < M 

(where we regard R and B as graphs on Y and use d for degree) and Z = 
Y\{SUT). 

The main point here is that, because 'rf is irredundant, 

if ab £ R and ac £ B (and b ^ c) then a6c '^. (73) 

Since the number of clauses uvw, which we are assuming to be at least 
an^, is at most (n — |T|)n + irli??! < (I/SI + \Z\)n + 'dn?, we must have either 
\Z\ > i?n or \Z\ < ^n and \S\ > ^n (see 1^ ). 

Suppose first that \Z\ > "dn. In this case, once we have specified Z and 
the R- and i?-edges meeting Z, (i73]l gives at least 'dn-'dn- {'dn — l)/Q positive 
clauses abc that are known to not belong to '^. We may thus (crudely) bound 
the number of possibilities for "^ of this type by the product of the factors: 
n (corresponding to the choice of u); 2" (choose Z); exp[n^] (for the R- 
and -B-edges meeting Z); exp[//(2(") • 7(3))] (for the remaining non-positive 
members of ^ (i.e. those not of the form uvw); here we use (b) of Step 
1); and exp[(l — '(9^) (3)] (for the remaining positive members of '^). This 
product is less than the first term on the right hand side of (j72p . 

Next suppose \Z\ < 'dn and IS*! > ^n. We first observe that n— \S\ can't 
be too small: the number of positive clauses of 'I0 using u is at least 0.02n^ 
(by (c) of Step 1), but also at most \S\-dn + (""2' '), which, after a little 
calculation, gives n — IS"! > \/.04 — 2'!? n. Thus in the present case we must 
have \T\ > (V.04 - 2i? - 'd)n > ^n. 

We may specify a ^ of the present type (i.e. with \Z\ < 'dn and IS"! > S^n, 
so also \T\ > ^n) by choosing: (i) u; (ii) S and T (so also Z); (iii) the i?-edges 
meeting S L) Z and the i3-edges meeting T L) Z; (iv) the i?-edges contained 
in T' := T\S and the 5-edges contained in S' := S\T; and (v) the clauses 
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not involving the variable u. The numbers of choices in (i), (ii) and (v) are 
at most n, 4" and I{n — 1) (respectively), while those for for (iii) and (iv) 
are bounded by 

exp[2^?n2 + {\S\ + \T\)H{^)n + (If') + ('f I)]. 

Combining these bounds with the easy 

('?') + ('?) <(V)-m-on^ 

we find that the number of 'i^'s in question is less than 

n4" exp[(^) - (^(1 -i)-2^- 2H{^))n^]I{n - 1), 
which is less than the second term on the right hand side of (|72p . {> 

Note that "^ G Xg implies (by (a) of Step 1) that for any u, 

'ia contains at most (4^ + 0)11? non-positive clauses using u or u. (74) 

Step 3. For a variable u, set X^ = {{v, w} : uvw G ^} and X^ = ( 2 ) \ 
X^j. Let X| consist of those '^ G X| with the property that for any three 
variables n, v, w, 

each of |Xu n x„ n x^l, |x„nx^nx^|, |x„nx„nx^„| 

and |Xu n X^, n X^t,| is at least 0.1(2)- (75) 

(The "0.1" is just a convenient constant smaller than 1/8.) We assert that 

1X2* \ ^3*1 < exp[(l - c)3(^)]/(n - 3). (76) 

Proof. We may choose "^ G Xg \ X3 by choosing: 
(i) u,v,w violating ([75]) : 

(ii) the non-positive clauses involving at least one of u,v,w; 
(iii) the positive clauses involving u,v,w, 
(iv) the clauses not involving u,v,w. 

The numbers of possibilities for the choices in (i), (ii) and (iv) may be 
bounded by Q, exp[3//((4C + a)n2)/(7(^)) • 7Q] < exp[21H{2C + a)Q] 
(see (j7i|) ) and I{n — 3) respectively. The main point is the bound for the 
number of choices in (iii), which, apart from the 2'-^("' possibilities for clauses 

45 



involving at least two of u, v, w, is bounded by the number of choices for an 
ordered partition of ( \i'^'''''"j^ into eight parts, at least one of which has 
size less than 0.1(2)- We assert (a presumably standard observation) that 
this number is less than 8exp[{H{.l) + .91og7)(2)]. which finishes Step 3 
since the product of the preceding bounds is less than the right hand side 

of dZSl). 

For the assertion, notice that the log of the number of (ordered) parti- 
tions H = Zi U • • • U Zg with \Zi\ < 0.1m is H{Yi, . . . ,Ym) < T.H{Yi), 
where we choose (Zi, . . . , Zg) uniformly from the set of such partitions and 
set Yi = j if i G Zj. (The inequality, an instance of Lemma 16.51 is a 
basic (easy) property of entropy; see e.g. jH Theorem 2.6.6].) Setting 
Pi{j) = Pic{Yi = j) (= Pr(i e Zj)) and pj = m~^ ^iPiiJ), we have 

EH{Y,) = E,E.P.(i)log: 



Piij) 
< m Y^jPj log j- = mH{pi,...,ps) 

(by Jensen's Inequality) and 

HiPi, ...,P8)< H{pi) + (1 - pi) log 7 < HiO.l) + 0.9 log 7 

(using H{X) < log |range(X)| for the first inequality). {> 

Step 4. Let 

X4 = {'t^ G X3 : no clause of "^ uses more than one negative literal.} 

Then 

|X3*\X||<exp[(l-c)(^)]. (77) 

Proof. We first observe that ^ S X| cannot contain a clause with exactly 
two negative literals. For suppose uvw £ '^ . Since "^ G X|, there is some 
pair {a, 6} with ahu^ abv, abw G ^; but this is impossible, since a witness for 
abw must agree with at least one of a6u, o6f , uvw. 

While the preceding argument doesn't quite work to exclude negative 
clauses, the assumption that uvw £ 'rf is extremely restrictive, since it says 
that whenever {a,b} £ X„ n X^ n X^^,, there cannot be any c ^ {u,v,w} 
with abc G ^ (since a witness for abc would have to agree with one of 
abu, abv, abw, uvw). So we may bound the number of "^'s that do contain 
negative clauses by the product of: n^ (choose u, v, w); exp[n^] (choose XuPl 
X„ n X^); exp[7H{2C){^) + 0(n^)] (for clauses that either are non-positive 
or involve u,v or w; here we again use (b)); and exp[("'3 ) — 0.1(2) (n — 
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3)/3] < exp[.9(3)] (for the remaining positive clauses; here the subtracted 
term corresponds to triples known to contain members of X„ n X„ n X^^,). 
And again, the product of these bounds is less than exp[(l — c){^)]. 

Step 5. Finally, we set 

X5 = {'rf G II : '^ contains no clause with exactly one negative literal} 
(so X5 C {^ £ I* : 'rf contains only positive clauses}) and show 

|X4*\X5*|<exp[(^)-cn]. (78) 

Proof. We show that for any t > (by (b) of Step 1 t will be at most C'^^j 
but we don't need this), 

ll*^ G X| : "^ has exactly t non-positive clauses}] < exp[(3) — c'n] (79) 

for a suitable c'; this gives (j78|) for any c < c' . 

Fix t and suppose 'rf is as in (j79p . The main point driving the argument 
(which, however, will take us a while to get to) is: 

if uvw G ^ and a {u, v, w}, then \^ n {auv, avw}\ < 1 (80) 

(since a witness for avw must agree with either auv or uvw). 

Let 'rf' be the set of non-positive clauses in ^. It will be helpful to 
introduce an auxiliary collection: for each C £ 'if' , we will fix an ordering 
of the three literals in C with the negative literal first, and write ^" for the 
resulting collection of ordered triples. We assert that we can do this so that 



\{w■.{u,v,w)£'^"}\<^/a/2n Wu^v. (81) 

This will follow from 

Proposition 8.1. Any (simple) graph admits an orientation with all out- 
degrees at most y/\E{G)\/2. 

Proof (sketch). A precise statement (due to Hakimi [10]; see also [El The- 
orem 61.1, Corollary 61.1b]) is: for any graph G = {V,E) and c : y — )■ N, 
there is an orientation with d^ < Cy W v (where, of course, d~ is the out- 
degree off) iff for every W QV, \E{G[W])\ < J^i'^v '■ v e W}; in particular, 
there is an orientation with d+ < c Vf iff c > max{|i?(G[VF])|/|VF| : W C 
V}, which is easily seen to hold with c = \y^\E{G)\/2 ] . 
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(Alternatively it's easy to see that orienting each edge toward the end of 
larger degree (breaking ties arbitrarily) gives maximum out-degree less than 
Y^2|£'(G)|, which would also be fine for present purposes.) 

To get ([HT]) from Proposition 18 -H regard, for a given n, {vw : uvw £ ^'} 
as the edge set of a graph Gu on X\{u}, and choose an orientation of E{Gu) 
as in the proposition. We have \E{Gu)\ < an^ (by ([71]) ): so interpreting 
orientation of vw toward w as specifying {u,v,w) G 'io' gives (jSTj) . 

Of course there will typically be many choices of ^" as above, and we 
fix one such for each 'lo' . Given 'lo" , set Q = Q{'io') = {{{u,v},{v,w}} : 
{u,v,w) G '^"}. Regard Q as a multigraph on the vertex set (2), and let u 
and T denote its matching and (vertex) cover numbers. Then 

2z. > r > r— 1 (82) 

gn 

(where, recall, g = v2a + (^). Here the first inequality is standard (and 
trivial) and the second follows from the fact that G has t edges and maximum 
degree at most gn, the latter by (fSTj) and Step 1. 

We now consider the number of possibilities for ^ with given a t, r and 
v. We first specify '^' by choosing a vertex cover T for the associated G 
and then a collection of t clauses, each using (the variables from) at least 
one member of 7~. The number of possibilities for these choices is at most 

We now suppose '^' has been determined and consider possibilities for 
the set, say "^o (= '^\^')-, of positive clauses of '^ . Let M. be some maximum 
matching of G-, say M. = {\\ui,Vi\ ,{vi,Wi\\ : i G [i^]}. (We could specify 
UiViWi G ^', but this is now unnecessary.) 

Let J be the set of all pairs of 3-sets {{a,Ui,Vi},{a,Vi,Wi}} such that 
{{ui,Vi} , {vi,Wi\} G M. and a {ui,Vi,Wi}, and let K, be the set of 3-sets 
belonging to pairs in J . Then J is a. set of at least v{n — 3)/2 pairs of 
3-sets (a given pair {{x,y, z}, {x,y, u;}} can arise with x in the role of Vi 
and y in the role of a or vice versa) with the property that no 3-set belongs 
to more than three members of J (since A^ is a matching) ; so in particular 
|/C| > z^(n-3)/3. 

We assert that the number of possibilities for "^o H /C is at most 

exp[(;j)-iK'^-3)(2-log3)]. 

Proof. This is another (somewhat more interesting) application of Lemma 
Let W^ = (3) (thought of as the collection of possible positive clauses); 
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let T be the collection of possible ^o's (compatible with the given ^'); and 
let 'H, consist of all pairs from J (note these are now pairs of elements of W^ 
together with, for each T G W ^ 3 — r\{T^ copies of the singleton {T}, where 
^(7") < 3 is the number of times T appears as a member of some pair in J . 
As noted earlier the key point is (|8U|) . which in the present language says 
that no member of T contains any {5*, T} G J . This implies in particular 
that for each such \^S^T^^ we have |Tr(J^, {5", T})| < 3, so that Lemma [631 
gives 

log|-F| < Hl^r6H/(3-^(r)) + l^|log3] 
< (3)-sK^-3)(2-log3) 

(since ^ r?(r) = 2\J\ and \J\ > u{n - 3)/2). <} 

Finishing the proof of ()78p is now easy. We have shown that the number 
of possibilities for ^ with given t, r and v is at most 

(f)(V)exp[(^)-H^-3)(2-log3)] 



< exp 



^) + {log ^ + 2,nH{e/2,) - ("-3)(2-i°g3) | 



(where we used ()82p (second and first inequalities respectively) for the last 
two terms in the exponent), and summing over r and v shows that the left 
side of dZS]) is less than exp[(!^) -c'n] for any c' < (2-log3)/12-3i?(^/3).<> 

Finally, combining dTO]), (I72|), ([76]), ([77]) and ([78]) (and, of course, the fact 
that II5I < exp [(3)]) gives (f20]) (where we again absorb terms exp[(l — c)(3)] 
from (f72|) and (j77|) in the term exp [(3) — en]). 
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